Inter-transcriber reliability of toBI prosodic labeling
نویسندگان
چکیده
The goal of this study was to evaluate the reliability among transcribers of a standard prosodic labeling system under relatively optimal conditions of training, supervision, facilities, procedures, and extent of speaker familiarity. The ToBI (Tones and Break Indices) model for standard American English[7][1] was used in the study; break indices indicate the degree of junction between words, pitch accents designate word prominence, and edge tones mark phrase boundaries. The American English speech corpora were read by a female professional speaker and by a male professional speaker, and were composed of several types of texts to ensure prosodic variety. Each of four experienced transcribers independently labeled each corpus. For each corpus, word level agreement in break indices, pitch accents, and edge tones between all possible pairs of transcribers was analyzed, and various statistics were calculated. Agreement among labelers was generally higher than that reported in previous studies[6][3] of larger and more diverse groups of labelers. Agreement was high for some prosodic categories, but low for others. The extent of reliability for various prosodic distinctions has important implications for re ning the ToBI model and for limitations in the use of prosody in speech technologies.
منابع مشابه
A comparison of inter-transcriber reliability for two systems of prosodic annotation: rap (rhythm and pitch) and toBI (tones and break indices)
Agreement was investigated among five labelers for the use of two prosodic annotation systems: the ToBI (Tones and Break Indices) system [1,2] and the RaP (Rhythm and Pitch) system [3]. Each system permits the labeling of pitch accents and two levels of phrasal boundaries; RaP also permits labeling of speech rhythm and distinguishes multiple levels of prominence on syllables. After training wit...
متن کاملAnalysis of inter-transcriber consistency in the Cat_ToBI prosodic labeling system
A set of tools to analyze inconsistencies observed in a Cat_ToBI labeling experiment are presented. We formalize and use the metrics that are commonly used in inconsistency tests. The metrics are systematically applied to analyze the robustness of every symbol and every pair of transcribers. The results reveal agreement rates for this study that are comparable to previous ToBI inter-reliability...
متن کاملA Comparison of Inter - Transcriber Reliab Annotation : RaP ( Rhythm and Pitch ) and
Agreement was investigated among five labelers for the use of two prosodic annotation systems: the ToBI (Tones and Break Indices) system [1,2] and the RaP (Rhythm and Pitch) system [3]. Each system permits the labeling of pitch accents and two levels of phrasal boundaries; RaP also permits labeling of speech rhythm and distinguishes multiple levels of prominence on syllables. After training wit...
متن کاملInter - transcriber reliability for two systems of prosodic annotation : ToBI ( Tones and Break Indices ) and RaP ( Rhythm and Pitch )
University of Massachusetts Amherst, Michigan State University, Massachusetts Institute of Technology Abstract Speech researchers often rely on human annotation of prosody to generate data to test hypotheses and generate models. We present an overview of two prosodic annotation systems: ToBI (Tones and Break Indices) (Silverman et al., 1992), and RaP (Rhythm and Pitch) (Dilley & Brown, 2005), w...
متن کاملVisualizing tool for evaluating inter-label similarity in prosodic labeling experiments
This paper presents a technique that allows us to detect similarities among prosodic labels used to describe pitch accents within the ToBI framework. The inter-label proximity is determined empirically as a result of the evidence obtained in contingency tables of inter-transcriber agreement tests and in the confusion matrices used in automatic prosodic labeling experiments. This tool may be use...
متن کامل